539 research outputs found

    The BRENDA Tissue Ontology (BTO): the first all-integrating ontology of all organisms for enzyme sources

    Get PDF
    BTO, the BRENDA Tissue Ontology (http://www.BTO.brenda-enzymes.org) represents a comprehensive structured encyclopedia of tissue terms. The project started in 2003 to create a connection between the enzyme data collection of the BRENDA enzyme database and a structured network of source tissues and cell types. Currently, BTO contains more than 4600 different anatomical structures, tissues, cell types and cell lines, classified under generic categories corresponding to the rules and formats of the Gene Ontology Consortium and organized as a directed acyclic graph (DAG). Most of the terms are endowed with comments on their derivation or definitions. The content of the ontology is constantly curated with ∼1000 new terms each year. Four different types of relationships between the terms are implemented. A versatile web interface with several search and navigation functionalities allows convenient online access to the BTO and to the enzymes isolated from the tissues. Important areas of applications of the BTO terms are the detection of enzymes in tissues and the provision of a solid basis for text-mining approaches in this field. It is widely used by lab scientists, curators of genomic and biochemical databases and bioinformaticians. The BTO is freely available at http://www.obofoundry.org

    BRENDA, the enzyme information system in 2011

    Get PDF
    The BRENDA (BRaunschweig ENzyme Database, http://www.brenda-enzymes.org) enzyme information system is the main collection of enzyme functional and property data for the scientific community. The majority of the data are manually extracted from the primary literature. The content covers information on function, structure, occurrence, preparation and application of enzymes as well as properties of mutants and engineered variants. The number of manually annotated references increased by 30% to more than 100 000, the number of ligand structures by 45% to almost 100 000. New query, analysis and data management tools were implemented to improve data processing, data presentation, data input and data access. BRENDA now provides new viewing options such as the display of the statistics of functional parameters and the 3D view of protein sequence and structure features. Furthermore a ligand summary shows comprehensive information on the BRENDA ligands. The enzymes are linked to their respective pathways and can be viewed in pathway maps. The disease text mining part is strongly enhanced. It is possible to submit new, not yet classified enzymes to BRENDA, which then are reviewed and classified by the International Union of Biochemistry and Molecular Biology. A new SBML output format of BRENDA kinetic data allows the construction of organism-specific metabolic models

    Automatic Assignment of EC Numbers

    Get PDF
    A wide range of research areas in molecular biology and medical biochemistry require a reliable enzyme classification system, e.g., drug design, metabolic network reconstruction and system biology. When research scientists in the above mentioned areas wish to unambiguously refer to an enzyme and its function, the EC number introduced by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (IUBMB) is used. However, each and every one of these applications is critically dependent upon the consistency and reliability of the underlying data for success. We have developed tools for the validation of the EC number classification scheme. In this paper, we present validated data of 3788 enzymatic reactions including 229 sub-subclasses of the EC classification system. Over 80% agreement was found between our assignment and the EC classification. For 61 (i.e., only 2.5%) reactions we found that their assignment was inconsistent with the rules of the nomenclature committee; they have to be transferred to other sub-subclasses. We demonstrate that our validation results can be used to initiate corrections and improvements to the EC number classification scheme

    Characterization of growth and metabolism of the haloalkaliphile Natronomonas pharaonis

    Get PDF
    Natronomonas pharaonis is an archaeon adapted to two extreme conditions: high salt concentration and alkaline pH. It has become one of the model organisms for the study of extremophilic life. Here, we present a genome-scale, manually curated metabolic reconstruction for the microorganism. The reconstruction itself represents a knowledge base of the haloalkaliphile's metabolism and, as such, would greatly assist further investigations on archaeal pathways. In addition, we experimentally determined several parameters relevant to growth, including a characterization of the biomass composition and a quantification of carbon and oxygen consumption. Using the metabolic reconstruction and the experimental data, we formulated a constraints-based model which we used to analyze the behavior of the archaeon when grown on a single carbon source. Results of the analysis include the finding that Natronomonas pharaonis, when grown aerobically on acetate, uses a carbon to oxygen consumption ratio that is theoretically near-optimal with respect to growth and energy production. This supports the hypothesis that, under simple conditions, the microorganism optimizes its metabolism with respect to the two objectives. We also found that the archaeon has a very low carbon efficiency of only about 35%. This inefficiency is probably due to a very low P/O ratio as well as to the other difficulties posed by its extreme environment

    ORENZA: a web resource for studying ORphan ENZyme activities

    Get PDF
    BACKGROUND: Despite the current availability of several hundreds of thousands of amino acid sequences, more than 36% of the enzyme activities (EC numbers) defined by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB) are not associated with any amino acid sequence in major public databases. This wide gap separating knowledge of biochemical function and sequence information is found for nearly all classes of enzymes. Thus, there is an urgent need to explore these sequence-less EC numbers, in order to progressively close this gap. DESCRIPTION: We designed ORENZA, a PostgreSQL database of ORphan ENZyme Activities, to collate information about the EC numbers defined by the NC-IUBMB with specific emphasis on orphan enzyme activities. Complete lists of all EC numbers and of orphan EC numbers are available and will be periodically updated. ORENZA allows one to browse the complete list of EC numbers or the subset associated with orphan enzymes or to query a specific EC number, an enzyme name or a species name for those interested in particular organisms. It is possible to search ORENZA for the different biochemical properties of the defined enzymes, the metabolic pathways in which they participate, the taxonomic data of the organisms whose genomes encode them, and many other features. The association of an enzyme activity with an amino acid sequence is clearly underlined, making it easy to identify at once the orphan enzyme activities. Interactive publishing of suggestions by the community would provide expert evidence for re-annotation of orphan EC numbers in public databases. CONCLUSION: ORENZA is a Web resource designed to progressively bridge the unwanted gap between function (enzyme activities) and sequence (dataset present in public databases). ORENZA should increase interactions between communities of biochemists and of genomicists. This is expected to reduce the number of orphan enzyme activities by allocating gene sequences to the relevant enzymes

    METANNOGEN: compiling features of biochemical reactions needed for the reconstruction of metabolic networks

    Get PDF
    BACKGROUND: One central goal of computational systems biology is the mathematical modelling of complex metabolic reaction networks. The first and most time-consuming step in the development of such models consists in the stoichiometric reconstruction of the network, i. e. compilation of all metabolites, reactions and transport processes relevant to the considered network and their assignment to the various cellular compartments. Therefore an information system is required to collect and manage data from different databases and scientific literature in order to generate a metabolic network of biochemical reactions that can be subjected to further computational analyses. RESULTS: The computer program METANNOGEN facilitates the reconstruction of metabolic networks. It uses the well-known database of biochemical reactions KEGG of biochemical reactions as primary information source from which biochemical reactions relevant to the considered network can be selected, edited and stored in a separate, user-defined database. Reactions not contained in KEGG can be entered manually into the system. To aid the decision whether or not a reaction selected from KEGG belongs to the considered network METANNOGEN contains information of SWISSPROT and ENSEMBL and provides Web links to a number of important information sources like METACYC, BRENDA, NIST, and REACTOME. If a reaction is reported to occur in more than one cellular compartment, a corresponding number of reactions is generated each referring to one specific compartment. Transport processes of metabolites are entered like chemical reactions where reactants and products have different compartment attributes. The list of compartmentalized biochemical reactions and membrane transport processes compiled by means of METANNOGEN can be exported as an SBML file for further computational analysis. METANNOGEN is highly customizable with respect to the content of the SBML output file, additional data-fields, the graphical input form, highlighting of project specific search terms and dynamically generated Web-links. CONCLUSION: METANNOGEN is a flexible tool to manage information for the design of metabolic networks. The program requires Java Runtime Environment 1.4 or higher and about 100 MB of free RAM and about 200 MB of free HD space. It does not require installation and can be directly Java-webstarted from

    A theoretical model for template-free synthesis of long DNA sequence

    Get PDF
    This theoretical scheme is intended to formulate a potential method for high fidelity synthesis of Nucleic Acid molecules towards a few thousand bases using an enzyme system. Terminal Deoxyribonucleotidyl Transferase, which adds a nucleotide to the 3′OH end of a Nucleic Acid molecule, may be used in combination with a controlled method for nucleotide addition and degradation, to synthesize a predefined Nucleic Acid sequence. A pH control system is suggested to regulate the sequential activity switching of different enzymes in the synthetic scheme. Current practice of synthetic biology is cumbersome, expensive and often error prone owing to the dependence on the ligation of short oligonucleotides to fabricate functional genetic parts. The projected scheme is likely to render synthetic genomics appreciably convenient and economic by providing longer DNA molecules to start with

    Intuitive Visualization and Analysis of Multi-Omics Data and Application to Escherichia coli Carbon Metabolism

    Get PDF
    Combinations of ‘omics’ investigations (i.e, transcriptomic, proteomic, metabolomic and/or fluxomic) are increasingly applied to get comprehensive understanding of biological systems. Because the latter are organized as complex networks of molecular and functional interactions, the intuitive interpretation of multi-omics datasets is difficult. Here we describe a simple strategy to visualize and analyze multi-omics data. Graphical representations of complex biological networks can be generated using Cytoscape where all molecular and functional components could be explicitly represented using a set of dedicated symbols. This representation can be used i) to compile all biologically-relevant information regarding the network through web link association, and ii) to map the network components with multi-omics data. A Cytoscape plugin was developed to increase the possibilities of both multi-omic data representation and interpretation. This plugin allowed different adjustable colour scales to be applied to the various omics data and performed the automatic extraction and visualization of the most significant changes in the datasets. For illustration purpose, the approach was applied to the central carbon metabolism of Escherichia coli. The obtained network contained 774 components and 1232 interactions, highlighting the complexity of bacterial multi-level regulations. The structured representation of this network represents a valuable resource for systemic studies of E. coli, as illustrated from the application to multi-omics data. Some current issues in network representation are discussed on the basis of this work

    Differential Interactions of the Autonomous Pathway RRM Proteins and Chromatin Regulators in the Silencing of Arabidopsis Targets

    Get PDF
    We have recently shown that two proteins containing RRM-type RNA-binding domains, FCA and FPA, originally identified through their role in flowering time control in Arabidopsis, silence transposons and other repeated sequences in the Arabidopsis genome. In flowering control, FCA and FPA function in the autonomous pathway with conserved chromatin regulators, the histone demethylase FLD and the MSI1-homologue FVE, a conserved WD-repeat protein found in many chromatin complexes. Here, we investigate how the RRM proteins interact genetically with these chromatin regulators at a range of loci in the Arabidopsis genome. We also investigate their interaction with the DNA methylation pathway. In several cases the RRM protein activity at least partially required a chromatin regulator to effect silencing. However, the interactions of the autonomous pathway components differed at each target analysed, most likely determined by certain properties of the target loci and/or other silencing pathways. We speculate that the RNA-binding proteins FCA and FPA function as part of a transcriptome surveillance mechanism linking RNA recognition with chromatin silencing mechanisms

    Quantitative global studies of reactomes and metabolomes using a vectorial representation of reactions and chemical compounds

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Global studies of the protein repertories of organisms are providing important information on the characteristics of the protein space. Many of these studies entail classification of the protein repertory on the basis of structure and/or sequence similarities. The situation is different for metabolism. Because there is no good way of measuring similarities between chemical reactions, there is a barrier to the development of global classifications of "metabolic space" and subsequent studies comparable to those done for protein sequences and structures.</p> <p>Results</p> <p>In this work, we propose a vectorial representation of chemical reactions, which allows them to be compared and classified. In this representation, chemical compounds, reactions and pathways may be represented in the same vectorial space. We show that the representation of chemical compounds reflects their physicochemical properties and can be used for predictive purposes. We use the vectorial representations of reactions to perform a global classification of the reactome of the model organism <it>E. coli</it>.</p> <p>Conclusions</p> <p>We show that this unsupervised clustering results in groups of enzymes more coherent in biological terms than equivalent groupings obtained from the EC hierarchy. This hierarchical clustering produces an optimal set of 21 groups which we analyzed for their biological meaning.</p
    corecore